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ABSTRACT 

This  paper  describes  and  illustrates  a  system  which 
incorporates  a  computerized  grid  mesh  technique  for  determining 
the  Thiessen  weighting  factors  to  estimate  areal  averages  from 
point  data.   The  set  of  procedures  was  developed  in  conjunction 
with  research  on  the  estimation  of  large  area  crop  yields  from 
weather  and  soils  data.    The  system  requires  the  latitudes  and 
longitudes  of  c limatological  stations,  and  boundary  coordinates 
obtained  from  digitizing  a  map  showing  the  crop  reporting 
districts.   The  package  of  computer  programs  includes 
transformations  which  reduce  climat ological  station  coordinates 
and  digitized  crop  district  boundaries  to  a  common  rectangular 
coordinate  system.   The  effectiveness  of  the  procedures  is 
demonstrated  numerically,  and  graphically  through  the  use  of  the 
SYMAP  computer  mapping  program,  for  crop  reporting  districts  in 
the  province  of  Saskatchewan.   The  application  of  the  computed 
weighting  factors  is  illustrated  for  the  June  1978  rainfall. 
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1.   INTRODUCTION 

The  Thiessen  polygon  method  for  estimating  areal  data  from 
point  observations  has  several  applications  in  meteorology  and 
geography  ( Rhy ns burg er ,  1973).   The  concept  is  quite  simple, 
but  adequate  for  many  purposes:   it  assumes  that  the  data  value 
anywhere  can  be  considered  to  be  the  same  as  that  at  the 
nearest  observation  point.   If,  for  example,  the  monthly 
rainfall  of  a  district  is  to  be  estimated  for  a  number  of 
months,  and  65%  of  that  district's  area  is  closest  to  weather 
station  A  and  35%  to 

station  B,  the  formula  used  is  Rp  =  .65R^  +  .35Rg, 
where  RD  is  the  district  rainfall  for  a  month  and  R^ 
and  Rg  are  the  station  rainfall  values  for  that  month. 
However,  in  a  computer  analysis,  the  manual  geometric 
construction  and  planimetric  methods  that  are  normally 
used  to  partition  the  district  and  compute  the  weighting 
factors  can  be  quite  inconvenient. 

The  authors  have  been  involved  in  large  area  crop  yield 
analyses  (Williams,  Joynt  and  McCormick,  1975;  Sheppard  and 
Williams,  1976)  which  required  reducing  weather  and  soils  data 
to  a  crop  district  basis  so  that  they  could  be  related  to  the 
district  yields.   Sometimes  for  such  work,  weather  data  that 
may  be  available  on  the  basis  of  districts  that  at  least 
approximately  correspond  to  the  crop  districts  can  be  used,  as 
was  the  case  in  the  United  States  spring  wheat  analyses  of 
Starr  and  Kostrow  (1978).   In  other  cases  the  data  may  not  be 
available  on  this  basis,  and  even  if  they  are  there  may  be 
reasons  for  using  some  particular  set  of  climat ological 
stations,  especially  where  operational  yield  prediction  is 
involved . 

In  preparing  the  weather  data  in  the  regional  yield 
analyses  in  which  we  were  involved,  the  Thiessen  polygon 
method  was  used  to  obtain  district  values  from  the  point  data 
(Williams  and  Robertson,  1965;  Newton-Smith  and  Williams, 
1964).   Because  of  the  work  involved  in  manually  re-determi- 
ning weighting  factors  for  different  sets  of  stations,  the  set 
of  precipitation  stations  that  had  been  available  during  the 
initial  phases  of  the  work  in  1962  continued  to  be  used  until 
1975,  even  though  this  meant  that  full  advantage  was  not  being 
taken  of  the  weather  data  that  were  later  available.   Problems 
also  arose  whenever  there  were  closures  or  discontinuities  in 
records  among  the  fixed  set  of  stations. 
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Further  crop  forecasting  work  (Williams  and  Lattimore, 
1978)  required  the  use  of  a  different  set  of  stations  (Fig.  1) 
and  the  possibility  of  employing  a  program  developed  by  Louie 
(1977)  to  compute  the  required  Thiessen  weighting  factors  was 
investigated.   This  program  requires  station  and  boundary 
locations  to  be  provided  in  a  rectangular  Cartesian  coordinate 
system,  whereas  station  locations  are  normally  given  in  terms 
of  latitudes  and  longitudes.   These  can  be  converted  to  a 
rectangular  system,  but  it  is  not  convenient  or  desirable  to 
convert  them  to  the  same  system  as  used  in  digitizing  the 
boundaries,  and  different  parts  of  the  boundary  set  may 
themselves  be  in  different  systems  if  they  were  digitized  at 
different  times.   Another  consideration  here  is  that  a 
strictly  computational  approach  to  determining  Thiessen 
weighting  factors  generally  would  preclude  the  possibility  of 
visual  inspection  which  could  otherwise  be  helpful  in  making  a 
final  selection  of  stations  to  be  used. 

We  have  therefore  developed  procedures  (Fig.  2)  which  take 
area  boundaries  as  digitized  from  a  map,  and  the  latitudes  and 
longitudes  of  the  stations  to  be  used,  and  compute  the 
required  Thiessen  weighting  factors.   The  procedures  also 
provide  computer  produced  maps  representing  the  polygons  and 
weighting  factors  if  desired.   The  system  incorporates  the 
Thiessen  weighting  factor  program  of  Louie  (1977),  together 
with  several  other  programs.   The  purpose  of  this  paper  is  to 
describe  this  system  or  set  of  procedures  and  illustrate  its 
application. 

2.   DESCRIPTION  OF  THE  SYSTEM 


a_»       Transformation  of  coordinates 

The  latitudes  and  longitudes  of  all  c limatological 
stations  in  Canada  are  readily  available.   In 

agrometeorological  studies  station  data  are  often  plotted  for 
overlaying  on  maps  using  the  Lambert  Conformal  Conic 
projection  with  standard  parallels  at  49°  and  77°N.    This 
projection  gives  good  directional  and  shape  relationships  for 
a  country  such  as  Canada,  which  has  a  great  east-west  extent, 
and  it  is  commonly  used  in  maps  showing  all  of  Canada  or  large 
parts  of  it. 
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SOURCE  DATA: 
Latitudes  and  longitudes  of  stations 
including  2  or  3  reference  points 


Use  Computer  routine  to  derive 
standard  coordinates. 


DATA  SET  A: 
Coordinates  of  stations  and  the  2 
or  3  reference  points  in  standard 
coordinates 


Use  computer  grid  mesh  routine  and 
data  sets  A  and  C  to  derive  Thiessen 
weighting  factors 


DATA  SET  D: 
Set  of  Thiessen  weighting  factors  for 
crop  districts 


Apply  SYMAP  using  proximal  map 
procedures  and  the  four  colour 
principle  to  map  Thiessen  polygons 


Perform  a  visual  validation  of 
Thiessen  weighting  factors 


SOURCE  DATA: 
Map  of  boundaries  on  Lambert 
Conformal  Conic  projection 


Prepare  boundary  map 
overlay. 


Digitize  boundaries  and  the  2  or  3 
reference  points. 


DATA  SET  B: 
Coordinates  of  digitized  closed 
boundaries  and  reference  points. 


Use  Computer  routine  to  transform 
digitized  coordinates  to  standard 
using  the  2  or  3  reference  points 
from  data  sets  A  and  B. 


DATA  SET  C: 
Set  of  district  boundaries  in  standard 
coordinates 


Using  SYMAP  prepare  map  showing 
polygons  and  district  boundaries 


\/ 


Fig.  2     Flow  chart  of  the  system  used  to  calculate  and  display  graphically 
the  Thiessen  weighting  factors. 
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A  computer  subroutine  obtained  from  the  Canada  Department  of 
Energy,  Mines  and  Resources  is  used  to  convert  the  geographical 
coordinates  in  degrees  and  minutes  to  X  and  Y  values  in  a 
rectangular  Cartesian  coordinate  system.   The  authors  have 
adopted  a  system,  to  be  referred  to  as  the  standard  coordinate 
system  throughout  the  remainder  of  this  paper,  in  which  the  95°W 
meridian  of  longitudes  is  vertical.   The  X  value  is  the  distance 
to  the  right,  and  Y  the  distance  up,  from  an  origin  533  mm  to 
the  left  of  the  41°N  95°W  point  on  a  1:5,000,000  scale  map. 
Thus  Y  measured  toward  the  top  edge  of  the  map  along  the  95th 
meridian  represents  distance  northward. 

The  basic  requirements  with  regard  to  data  for  computing 
Thiessen  weighting  factors,  or  mapping  the  polygons  by  computer 
as  described  here,  are  a  set  of  latitudes  and  longitudes  of 
stations  and  a  source  map  showing  the  district  boundaries. 
There  should  be  some  reference  points  on  the  source  map, 
preferably  more  than  two,  for  which  the  latitudes  and  longitudes 
are  known. 

The  district  boundary  map  must  be  digitized  to  provide  the 
boundary  information  for  input  to  the  computer  if  this  has  not 
already  been  done.   This  involves  recording  X  and  Y  coordinates 
for  points  along  each  boundary,  and  also  for  each  reference 
point.   This  is  usually  done  on  a  digitizing  table  equipped  to 
record  the  coordinates  on  tape  or  cards  as  the  operator  moves  a 
cursor  along  the  line  being  digitized,  although  for  small  jobs 
it  can  be  done  manually  with  the  aid  of  graph  paper.   The  units 
of  measurement  employed  for  the  coordinates  normally  depend  on 
the  equipment  used,  and  the  positioning  of  the  axes  is 
determined  by  convenience  except  that  we  recommend  placing  the 
west  edge  of  the  map  at  the  top  for  digitizing. 

Using  the  same  conversion  routine  as  in  the  case  of  the 
climatological  stations,  the  latitudes  and  longitudes  of  the 
reference  points  are  converted  to  the  standard  coordinate 
system.    The  digitized  boundary  coordinate  data  are  then 
transformed  by  linear  transformation  procedures  to  the  standard 
coordinate  system  as  used  for  the  station  locations. 


The  computer  software  to  quantify  the  linear  transformation 
and  then  to  apply  it  to  the  digitized  coordinate  data  is 
combined  in  one  computer  program.   Firstly,  the  coordinates  of 
the  reference  points  in  the  standard  and  in  the  digitized 
coordinate  system  are  read  into  the  computer  and  the  parameters 
of  the  linear  transformation  are  established  by  solving  a  system 
of  linear  equations.   The  digitized  coordinate  data  are  then 
read  and  transformed  to  the  standard  coordinate  system  using  the 
appropriate  transformation  matrix.    We  will  describe  in  more 
detail  how  this  is  carried  out  for  the  case  where  three 
reference  points  are  used  and  for  the  special  case  where  only 
two  are  available. 
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Let  XS(K),  YS(K)  denote  the  coordinates  of  the  Kth  point  in 
the  standard  coordinate  system  and  XD(K),  YD(K)  denote  the 
coordinates  of  the  same  point  in  the  digitized  coordinate 
system.   The  linear  transformation  relating  these  two 
coordinates  can  be  expressed  in  matrix  notation  as: 
*XS(K) 


A(l,l) 

A(l,2) 

XD(K) 

-L 

B(l) 

A(2,l) 

A(2,2) 

YD(K) 

T 

B(2) 

YS(K) 


We  note  that  in  (1)  there  are 
A(I,J)'s  and  the  two  B(I)'s. 


(1) 


six  unknowns,  namely,  the  four 


For  the  three-reference  point  case  we  denote  the  coordinates 
of  the  reference  points  in  the  standard  coordinate  system  by 
(XRS(K),  YRS(K))  K  =  1,2,3  and  in  the  digitizing  system,  by 
(XRD(K),  YRD(K))  K  =  1,2,3.   From  equation  (1)  the  following  two 
matrix  equations  are  formed: 


XRS(l) 

XRS(2) 

= 

XRS(3) 

XRD(l) 
XRD(2) 
XRD(3) 


YRD(l) 
YRD(2) 
YRD(3) 


B(l) 

A(l,l) 

A(l,2) 


(2) 


and 


YRS(l) 

1 

XRD(l) 

YRD(l) 

B(2) 

YRS(2) 

= 

1 

XRD ( 2  ) 

YRD(2) 

A(2,l) 

YRS(3\ 

1 

XRD(3) 

YRD(3) 

,A(2,2) 

(3) 


The  computer  system  reads  in  the  digitized  and  standard 
coordinates  of  the  reference  points  and  solves  these  two 
equations  for  the  A(I,J)'s  and  the  B(I)'s  using  Gaussian 
elimination  with  partial  pivoting  (Dahlquist  et  al. ,  1974).   The 
digitized  coordinate  data  are  then  read  and  transformed  into  the 
standard  coordinate  system  using  equation  (1). 


An  analysis  of  the  nature  of  the  transformation  required  to 
relate  the  digitizing  table  coordinate  system  to  the  standard 
system  suggests  that  there  would  normally  be  valid  assumptions 
that  could  reduce  the  number  of  parameters  to  be  estimated  from 
six  to  four.   This  in  turn  reduces  the  number  of  reference 
points  required  from  three  or  two.   The  assumptions  are  that  the 
coordinate  axes  on  the  digitizing  table  can  be  transformed  to 
the  standard  axes  by  performing  a  rigid  rotation,  a  translation 
and  an  identical  scale  change  in  both  the  X  and  Y  coordinates. 

To  motivate  the  restraints  that  these  assumptions  put  on  the 
matrix  A  in  equation  (1),  consider  the  rigid  rotation  of  two 
perpendicular  axes  through  an  angle  6.   Take  a  fixed  point  with 
coordinates,  (X(l),  Y(l))  in  the  initial  coordinate  system  and 
(x(l)>  y(l))  in  the  new  system.   Using  simple  trigonometric 
relations  we  can  specify  the  transformation  with: 
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x(l) 

y(DJ 


cos9    sin6 
■sinS   cose 


x(i) 

LY(1)J 


(4) 


XS(K) 

A(1,D 

A(l,2) 

XD(K) 

_l_ 

B(l) 

_YS(K) 

-A(l,2) 

A(l,l)_ 

YD(K)_ 

T^ 

,B(2) 

If  a  translation  and  a  scale  change  identical  in  both  directions 
is  combined  with  the  above  rotation,  we  obtain  the  general 
relat  ion 


(5) 


Comparing  (1)  with  (5)  we  note  that,  as  in  (4),  A(2,l)  =  -A(l,2) 
and  A(2,2)  =  A(l,l).   For  this  system  only  four  parameters 
require  estimation  to  specify  the  transformation.   To  evaluate 
the  four  parameters  our  computer  system  reads  in  (XRS(K), 
YRS(K))  and  (XRD(K),  YRD(K))  for  K  =  1,2.   It  then  forms  from 
equation  (5)  the  matrix  equation 


'XRS(l)" 

XRS(2) 

YRS(l) 

— 

JRS(2). 

1 

0 

XRD(l) 

1 

0 

XRD(2) 

0 

1 

YRD(l) 

0 

1 

YRD(2) 

YRD(l)" 

YRD(2) 

XRD(l) 

XRD(2)_ 

B(l) 

* 

B(2) 

A(l, 

1) 

A(l, 

2)J 

(6) 


Equation  (6)  is  solved  for  A(l,l),  A(l,2),  B(l)  and  B(2)  using 
the  same  Gaussian  elimination  routine.   The  digitized  coordinate 
data  then  can  be  transformed  into  the  standard  coordinate  system 
using  equation  (5). 

_b.   Weighting  factor  computations 

After  the  locations  of  the  district  boundaries  and 
meteorological  stations  have  been  transformed  to  a  standard 
rectangular  coordinate  system,  they  are  ready  for  use  as  input 
to  the  program  to  calculate  the  Thiessen  weighting  factors. 

Rather  than  working  analytically  with  polygons,  we  adopted 
the  grid  mesh  technique  developed  by  Louie  (1977).   This 
technique  requires  that  the  boundary  coordinates  for  a  district 
be  read  in  a  clockwise  order  and  that  all  the  station 
coordinates  be  entered.   The  computer  program  then  establishes  a 
rectangular  grid  which  encloses  the  district.   Grid  points  lying 
within  the  district  are  systematically  checked  to  see  which 
station  they  are  closest  to.    The  proportion  of  points  in  the 
district  closest  to  a  station  determines  that  station's  Thiessen 
weighting  factor. 

Required  levels  of  accuracy  are  achieved  by  iteratively 
refining  the  grid  mesh.  Louie  (1977)  tested  the  method  and 
provided  an  illustration  of  the  one-district  case. 
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Our  type  of  application  involves  a  large  region  which  is 
divided  into   number  of  districts,  with  stations  scattered 
across  the  region  (Fig.  1).   Our  initial  approach  was  to  only 
enter  the  stations  with  each  district  that  could  reasonably  be 
expected  to  have  a  non-zero  Thiessen  weighting  factor.   After 
testing  the  procedure  where  all  the  stations  were  included  with 
each  district  it  was  found  that  the  routine  was  capable  of 
quickly  eliminating  every  station  with  a  zero  Thiessen 
coefficient.   Our  system  is  therefore  designed  to  read  in  the 
coordinates  of  all  the  available  stations.   It  then  reads  in  the 
digitized  boundary  and  performs  the  Thiessen  calculations  for 
each  district  in  turn. 

The  number  of  iterations  per  district  required  by  the  grid 
mesh  technique  is  controlled  by  the  rate  of  convergence  and  the 
level  of  accuracy  specified  by  the  user.   In  addition,  we 
specify  that  there  should  be  a  minimum  of  four  and  a  maximum  of 
seven  iterations.   The  reason  for  using  a  minimum  of  four 
iterations  is  that  we  observed  cases  where,  for  the  coarse  grids 
used  in  the  first  few  iterations,  a  station  could  be  assigned  a 
zero  Thiessen  weight,  but  the  weight  subsequently  became 
non-zero  . 

When  the  grid  mesh  technique  was  applied  in  preliminary 
testing  to  the  42  crop  districts  and  65  precipitation  stations 
that  had  been  used  in  previous  analyses,  the  computer  Thiessen 
weighting  factors  corresponded  quite  well  to  those  that  had  been 
determined  earlier  by  much  more  laborious  methods. 

c  .   Visual  representation 

The  standard  coordinates  of  the  stations  and  boundaries 
prepared  for  use  in  the  computation  of  the  Thiessen  weighting 
factors  can  readily  be  employed  as  input  for  the  SYMAP  computer 
mapping  program  (Dougenik  and  Sheehan,  1976).   This  program 
utilizes  the  standard  computer  printer  to  prepare  maps,  using 
combinations  of  printed  and  overprinted  characters  to  produce 
patterns.   Other  applications  of  this  program  in  agrometeorology 
have  been  demonstrated  by  Williams  and  Sharp  (1972)  and  by 
Williams,  McKenzie  and  Sheppard  (1980). 

The  proximal  map  option  of  SYMAP  is  well  adapted  to  mapping 

Thiessen  polygons.   With  it  one  can  use  different  patterns  for 

different  polygons,  and  the  edges  between  polygons  may  be  left 

as  white  lines,  or  they  can  be  lines  of  printed  symbols. 

Alternatively,  the  polygons  can  be  left  white  and  only  their 
outlines  shown  (Fig.  3). 

The  boundaries  of  the  districts  can  be  depicted  at  the  same 
time  by  using  the  "Otolegend"  package  of  SYMAP  (Fig.  4).   Thus  a 
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Fig.  3  Map  of  Saskatchewan  crop  growing  area  prepared  with 
SYMAP,  showing  Thiessen  polygons  for  c limat ological 
stations  numbered  as  in  Table  1. 
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Fig.  4    SYMAP  map  showing  Saskatchewan  crop  districts  overlaid 
on  the  Thiessen  polygons. 
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visual  representation  can  be  provided  of  the  proportions  of  each 
district  that  are  in  various  polygons,  i.e.  the  computer 
Thiessen  weighting  factors  are  represented  by  relative  areas  on 
the  map  that  is  printed  out  using  SYMAP. 

In  mapping  with  colours  one  could  use  only  four  colours  to 
prepare  a  map  of  polygons  such  that  no  edge  between  adjacent 
polygons  had  the  same  colour  on  both  sides.   Similarly,  a 
"four-level  principle"  applies  with  SYMAP;  the  program  normally 
stratifies  the  data  into  several  classes  or  levels  and  applies  a 
different  pattern  to  each,  and  as  few  as  four  levels  can  be  used 
to  show  all  the  polygons  distinctly  in  mapping  Thiessen  polygons 
with  SYMAP  (Fig.  4).   To  do  this  one  employs  a  set  of  dummy 
input  data  such  that  no  two  adjacent  polygons  will  have  the  same 
level . 

3.   THIESSEN  POLYGON  COMPUTATIONS  FOR  SASKATCHEWAN  CROP  DISTRICTS 

«i  •   Determining  the  Thiessen  weighting  factors 
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Eq.  (2)  and  (3)  were  solved  to  obtain  the  A(I,J)'s  and 
B(I)'s  using  the  Gaussian  elimination  routine,  and  (1)  was  then 
applied  to  transforming  the  boundary  coordinates  to  the  standard 
system.   For  example,  using  the  computed  values  of  A(l,l), 
A(l,2),  and  B(l),  for  boundary  point  (5123,19662)  the  new 
X-value  is  XS  =  .0003955(5123)  +  .00006452(19662)  +  7.554  = 
10.85. 

The  standard  coordinates  for  the  stations  were  determined 
from  the  latitudes  and  longitudes.   For  example  for  station  14, 
Saskatoon  (52°10f  N,  106°41'  W),  use  of  the  appropriate  computer 
subroutine  yielded  the  standard  coordinates  (XS,  YS)  =  (14.81, 
10.45)  (Table  1). 


The  grid  mesh  program  was  then  applied  to  the  station  and 
boundary  standard  coordinates  to  determine  the  Thiessen 
weighting  factors.   Examples  are  given  here  (Table  2)  for 
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TABLE  1.  Cllaatologlcal  station*  with  their 
and  X-T  coordinates  derived  lor  a 


number*,  latitudes,  longitudes, 
Lasbert  confornal  conic  projection. 


Station 

Number 

Latitude 

Longitude 

X 

T 

Lathbrldge 

1 

49°38' 

1120481 

11.03 

9.09 

Medicine  Bat 

2 

50°1' 

110°43' 

12.25 

9.10 

Brooks 

3 

50°35« 

111054' 

11,74 

9.75 

Calgary 

4 

5lV 

U4°l' 

10.75 

10.50 

Red  Deer 

5 

52°11' 

113°54' 

11.08 

11.38 

Coronation 

6 

52°6' 

111°27' 

12.32 

10.96 

Veralllon 

7 

53°21« 

110O50, 

12.90 

11.92 

Edmonton 

8 

53°19' 

113°35* 

11.53 

12.27 

Cold  Lake 

9 

54°25' 

110°17' 

13.40 

12.75 

Meadow  Lake 

10 

54°7' 

108°26' 

14.25 

12.29 

Nipavln 

11 

53°20* 

104°0* 

16.36 

11.22 

Prince  Albert 

12 

53°13« 

105°41' 

15.49 

11.26 

North  Battleford 

13 

52°46' 

108°15' 

14.10 

11.13 

Saskatoon 

14 

52°10' 

106°41' 

14.81 

10.45 

Hudson  Bay 

15 

52°52' 

102°24' 

17.14 

10.72 

Wynyard 

16 

51°46' 

104°12* 

16.07 

9.90 

Torkton 

17 

51°16' 

102°28' 

16.94 

9.34 

Klndersley 

18 

51°28' 

109°9* 

13.39 

10.13 

Swift  Current 

19 

50°17« 

107°41' 

13.96 

8.95 

Moose  Jaw 

20 

50°20' 

105°33' 

15.14 

8.78 

Reglna 

21 

50°26' 

104°40' 

15.64 

8.79 

Broadview 

22 

50O23' 

102°35' 

16.78 

8.59 

Rock  Clen 

23 

49°10' 

105°58' 

14.74 

7.81 

Estevan 

24 

49°«' 

103°0' 

16.41 

7.47 

Brandon 

25 

49°52* 

99°58* 

18.21 

8.04 

Pilot  Mound 

26 

«9°12' 

98°53' 

18.77 

7.37 

Portage  La  Prairie 

27 

49°54' 

98°16* 

19.16 

7.96 

Winnipeg 

28 

49°54« 

97°14« 

19.74 

7.94 

Clmll 

29 

50°37' 

96°59* 

19.90 

8.56 

Dauphin 

30 

51°6' 

100°3' 

18.24 

9.07 

The  Pas 

31 

53°58' 

10lV 

17.90 

11.59 

La  combe 

32 

S2°28' 

113°45' 

11.23 

11.60 

Indian  Head 

33 

50°32* 

103°40' 

16.20 

8.79 

Scott 

34 

52°22' 

108°50' 

13.72 

10.85 

Melfort 

35 

52°49' 

104°36' 

15.99 

10.83 

Morden 

36 

«9°n' 

98°5' 

19.23 

7.33 
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TABLE  2.   Sample  results  for  two  crop  districts  of  the  iterative  calculation  of  Thiessen  weighting  factors 


4a 


2b 


Iteration 

Total  Number  of 
Grid  Points 

Grid  Points 
Within  Boundary 

74 

Station  Numbers  with 
as  Percentage 

Factors  Exp 
of  Area 

ressed 

1 

228 

2 

59.5 

19 

40.5 

2 

833 

295 

2 

59.7 

19 

40.3 

3 

3,122 

1,182 

2 

58.7 

19 

41.3 

4 

12,019 

4,734 

2 

58.6 

19 

41.4 

1 

96 

29 

20 

31.0 

21 

34.5 

22 

0 

33 

34.5 

2 

327 

131 

20 

30.5 

21 

36.6 

22 

0 

33 

32.8 

3 

1,167 

531 

20 

29.8 

21 

35.8 

22 

0 

33 

34.5 

4 

4,406 

2,126 

20 

30.3 

21 

36.0 

22 

0 

33 

33.7 

5 

17,042 

8,537 

20 

30.7 

21 

36.0 

22 

0. 

1 

33 

33.2 

6 

66,791 

34,179 

20 

30.7 

21 

36.2 

22 

0 

1 

33 

33.1 
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TABLE  3.   Thiessen  weighting  factors  and  stations  for  Saskatchewan 
crop  districts. 


District 

la 

24 

.8856 

22 

.0944 

25 

.0200 

lb 

22 

.9564 

33 

.0405 

25 

.0030 

2a 

21 

.3738 

24 

.3518 

33 

.1884 

20 

.0628 

22 

.0167 

23 

.0065 

2b 

21 

.3622 

33 

.3306 

20 

.3067 

22 

.0006 

3as 

23 

.7810 

24 

.1627 

20 

.0434 

21 

.0129 

3an 

20 

.7931 

19 

.1810 

23 

.0259 

3b  s 

19 

.7443 

23 

.2557 

3bn 

19 

.8991 

18 

.0790 

14 

.0219 

4a 

2 

.5858 

19 

.4142 

4b 

18 

.3744 

19 

.3252 

2 

.3004 

5a 

17 
16 

.4351 
.1013 

33 

21 

.2486 
.0303 

22 

.1842 

5b 

16 

.5133 

17 

.4072 

15 

.0562 

35 

.0234 

6a 

16 

.3784 

14 

.2598 

21 

.1896 

20 

.1721 

6b 

14 

.8806 

20 

.0905 

13 

.0145 

18 

.0084 

12 

.0053 

19 

.0007 

7a 

18 

.9453 

34 

.0324 

14 

.0223 

7b 

34 

.8111 

13 

.1146 

18 

.0551 

14 

.0193 

8a 

15 

.3622 

11 

.3245 

35 

.3123 

16 

.0010 

8b 

35 

.4921 

14 

.2365 

12 

.2346 

16 

.0369 

9a 

12 

.4087 

13 

.2740 

10 

.1293 

11 

.1159 

14 

.0663 

35 

.0056 

9b 

10 
34 

.3961 
.0851 

13 

.2423 

7 

.1403 

9 

.1362 
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TABLE  4.   June  1978  rainfall  for  all  stations,  and  for  Saskatchewan 

districts  computed  from  the  station  data  using  the  weighting 
factors  from  Table  3. 


Station 

Rain  in  mm 

District 

Rain  in  mm 

1 

17.6 

la 

111.2 

2 

53.4 

3 

60.7 

lb 

34.8 

4 

59.6 

5 

112.6 

2a 

80.1 

6 

19.3 

7 

39.5 

2b 

56.4 

8 

40.6 

9 

15.6 

3as 

84.6 

10 

31.7 

11 

115.2 

3an 

50.6 

12 

124.6 

13 

45.7 

3bs 

64.5 

14 

39.7 

15 

56.3 

3bn 

56.1 

16 

68.2 

17 

55.0 

4a 

55.9 

18 

22.6 

19 

59.4 

4b 

43.8 

20 

47.7 

21 

56.8 

5a 

54.7 

22 

33.6 

23 

79.4 

5b 

63.1 

24 

121.3 

25 

28.7 

6a 

55.1 

26 

81.9 

27 

40.5 

6b 

40.8 

28 

48.6 

29 

49.0 

7a 

24.0 

30 

118.8 

31 

40.4 

7b 

52.0 

32 

62.9 

33 

64.2 

8a 

91.8 

34 

55.2 

35 

108.8 

8b 

94.7 

36 

11.6 

9a 

84.2 

9b 

36.0 
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two  districts.   For  Saskatchewan  District  4a,  it  was  clear  from 
the  first  that  only  stations  2  and  19  were  involved,  and  by  the 
fourth  iteration  the  results  had  converged  and  indicated  that 
58.6%  of  the  district  area  was  closest  to  station  2  and  41.4%  to 
station  19.   For  District  2b,  at  first  only  stations  20,  21  and 
33  were  involved  but  after  the  fourth  iteration  station  22  was 
also  included,  although  only  a  very  small  part  of  the  area  was 
closest  to  that  station. 

To  provide  a  visual  representation,  the  SYMAP  program  was 
used  to  map  polygons  showing  the  areas  closest  to  the  various 
stations  (Fig.  3).   A  crop  district  map  was  then  overlaid  on 
this  and  for  each  district,  the  portion  of  the  area  of  each 
polygon  that  fell  within  the  district  was  inspected  and  the 
corresponding  final  weighting  factor  from  the  grid  mesh 
calculations  (Table  3)  was  checked  for  reasonableness.   This 
visual  inspection  also  helped  us  to  assess  the  adequacy  of 
spatial  representation.   For  example  District  2b,  with  three 
stations  well  distributed  within  the  district,  is  quite 
adequately  represented  for  purposes  such  as  large  area  yield 
analysis.   In  contrast  District  4a  is  covered  by  the  polygons  of 
two  stations  some  distance  outside  the  district,  to  the 
northwest  and  northeast,  and  is  therefore  not  well  represented. 

_b«   Applying  the  weighting  factors  to  computing  district  rainfall 

The  application  of  the  weighting  factors  is  here  illustrated 
by  an  example  taken  from  the  1978  yield  forecast  computations 
carried  out  in  Agriculture  Canada.   These  computations  included 
the  transformation  of  rainfall  data  from  a  station  to  a  district 
basis  (Table  4).   For  example,  the  June  1978  District  4a 
rainfall  was  computed  from  the  amounts  for  stations  2  and  19 
as:   .586(53.4)  +  .414(59.4)  =  55.9  mm. 

A  visual  representation  of  the  station  data  on  a  polygon 
basis  (Fig.  5a)  appears  quite  different  from  that  of  the  same 
data  using  isolines  (Fig.  5b).   The  isolines  give  a  more 
realistic  visual  representation  of  the  spatial  patterns  of  the 
rainfall,  but  for  the  purpose  of  computation  of  district 
rainfall,  it  is  much  more  convenient  to  use  the  polygon 
approach . 

Applying  the  Thiessen  weighting  factors  in  computing  the 
district  values  in  effect  spatially  smoothes  the  data,  so  the 
district  data  tend  to  be  less  variable  than  are  the  station  data 
on  which  they  are  based,  even  when  the  number  of  districts  is 
relatively  large.   For  example,  in  the  northern  part  of  the 
Saskatchewan  agricultural  region,  of  the  four  rainfall  classes 
only  the  lowest  three  appear  in  the  district  data  (Fig.  5c),  but 
in  the  mapping  based  on  station  data  all  four  of  the  levels, 
incuding  the  extreme  highest  (darkest  shading)  appear  in  the 
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northern  area.   (Figures  5a  and  b) .   The  sample  standard 
deviation  for  the  20  Saskatchewan  districts  is  22.7  mm,  while 
for  the  18  Saskatchewan  stations  it  is  31.5. 


4.   DISCUSSION  OF  AREAL  DISTORTION 


The  Lambert  Conformal  Conic  projection,  which  w 
has  several  advantages  for  this  work,  but  as  with  a 
projections  it  exaggerates  or  reduces  relative  area 
part  to  another  of  the  map  (Robinson  and  Sale,  1969 
be  suggested  that  the  use  of  the  Thiessen  polygon  w 
projection  could  lead  to  errors  because  equal  areas 
earth's  surface  often  do  not  correspond  to  equal  ar 
map.  To  investigate  the  extent  of  this  problem  for 
considered,  it  was  decided  to  calculate  and  compare 
areas  in  the  north  and  the  south  of  Saskatchewan, 
were  used  because  of  the  ease  with  which  areas  can 
given  the  coordinates  of  their  vertices  either  in  s 
rectangular  coordinates. 
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5 .   CONCLUSIONS 

This  paper  describes  and  demonstrates  a  practical  system  for 
employing  readily  available  point  and  boundary  location 
information  to  derive  weighting  factors  for  use  in  computing 
areal  meteorological  data  from  station  data.   It  is  particularly 
effective  for  applications,  such  as  large  area  yield  prediction, 
where  a  number  of  bounded  districts  is  involved  and  there  are 
frequent  changes  in  the  station  set.   The  system  includes 
computer  mapping  procedures  based  on  SYMAP,  to  aid  the  user  in 
understanding  the  spatial  relationships  and  in  assessing  the 
suitability  of  the  station  distribution. 

The  study  confirms  the  effectiveness  of  the  grid  mesh 
technique  for  calculating  the  Thiessen  weighting  factors, 
especially  in  comparison  with  traditional  manual  techniques. 
The  computer  implementation  of  this  grid  mesh  technique  proved 
to  be  very  efficient  for  the  sample  application  considered 
here.   The  computer  time  requirements  depend  largely  on  the 
level  of  accuracy  required  and  the  resulting  number  of 
iterations. 

In  order  to  use  the  grid  mesh  technique,  the  station  and 
district  boundary  coordinates  must  be  available  in  a  compatible 
form.   The  system  presented  here  demonstrates  how  quite  standard 
source  data  can  be  reduced  to  the  required  form.   The  linear 
transformation  was  shown  to  be  effective  in  reducing  large 
amounts  of  data  from  different  X-Y  coordinates  system  to  a 
common  base . 

Usually  the  digitized  boundary  coordinate  set  should  include 
at  least  three  reference  points.   Where  a  previously  digitized 
set  that  lacks  such  reference  point  coordinates  is  being 
employed,  the  ability  of  the  authors'  system  to  specify  the 
transformation  using  only  two  reference  points  can  considerably 
reduce  the  effort  required. 

The  use  of  the  Lambert  conformal  conic  projection  with  the 
specified  standard  parallels  was  found  to  be  acceptable  for  area 
determinations  in  the  region  of  interest. 

The  source  map  and  computer  maps  can  be  any  convenient 
scale,  and  the  system  could  readily  be  adapted  for  use  with 
other  map  projections  for  which  the  computer  routines  for 
converting  from  latitude  and  longitude  were  available.   For 
example  the  Universal  Transverse  Mercator  (UTM)  projection, 
which  is  appropriate  for  detailed  studies  of  relatively  small 
areas  and  is  commonly  used  in  topographic  maps,  provides  a 
rectangular  grid  system.   These  UTM  grid  coordinates  can  be 
input  for  SYMAP  mapping  (Williams,  McKenzie  and  Sheppard, 
1980).   They  could  also  be  used  directly  in  the  Thiessen  grid 
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mesh  program,  and  the  linear  transformation  described  here  could 
readily  be  employed  to  reduce  boundary  coordinates  digitized  on 
a  UTM  map  to  the  UTM  grid  system. 
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The  system  developed  by  the  authors  could  be  employed  in 
ographic  data  structure  analyses.   The  districts  themselves 
polygons,  so  the  situation  (Fig.  4)  is  one  of  overlapping 
gons.   Peucker  and  Chrisman  (1975)  suggest  that  in  such 
s  attention  be  directed  to  the  "Least  Common  Geographic 
s"  (LCGU's),  i.e.  the  areal  units  uncut  by  any  lines.   These 
's  are  here  the  basis  of  the  Thiessen  weighting  factors, 
example,  there  are  two  LCGU's  in  District  4a,  and  the  ratio 
heir  areas,  5858:4142,  indicates  the  relative  weights  to  be 
n  to  data  from  stations  2  and  19  in  computing  areal  values 
4a  (Fig.  4  and  Table  3). 
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